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Abstract. A sample of 197 long BATSE GRBs is studied statistically. In the sample 11 variables, 
describing for any burst the time behavior of the spectra and other quantities, are collected. The 
application of the factor analysis on this sample shows that five factors describe the sample satisfac- 
torily. Both the pseudo-redshifts coming from the variability and the Amati-relation in its original 
form are disfavored. 
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INTRODUCTION 

Factor Analysis (FA) and the Principal Component Analysis (PCA) are powerful statis- 
tical methods in the data analysis. [2] showed that the 9 variables (T50, T90, P54, P256> 
^1024> ^1,^2,^3 and J^t) of the BATSE GRBs can be satisfactorily represented by 
3 hidden statistical variables. [3] studied the statistical properties of 197 long BATSE 
GRBs, using 10 statistical variables describing the temporal and spectral properties of 
GRBs. Performing a PCA they concluded that about 70 % of the total variance of the 
parameters were explained by the first 3 Principal Components (PCs). 

FA assumes that the observed variables can be explained as a linear combination of 
hidden variables as given by: 

X = Af + £, (1) 

where x marks an observed variable of p dimension, A is a matrix ofpxm dimension 
(m < p) and / means a hidden variable of m dimension. The components of A are called 
loadings and those of / factor scores; £ is a noise term. Observation yields x while the 
quantities on right-hand-side of Eq.l have to be computed by a suitable algorithm: here 
we use the Maximum Likelihood (ML) method. An interesting property of the A matrix 
of factor loadings is that, after undertaking an orthogonal transformation (rotation) on 
it, one gets an other possible factor solution. Rotation is often useful to get a solution, 
which is much easier to interpret: we use the varimax rotation in our calculations. 



TABLE 1. Factors coming from FA with five factors SSload 
is the sum of the squares of the loadings; PropVar defines 
the proportion of SSload to the sum of variances of the input 
variables; CVar defines the sum of proportional variances. 
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THE SAMPLE 

Here we apply the FA on the same sample of 197 long GRBs investigated by [3]: for 
each burst we use the following 11 variables: duration time Tgo, emission time £^o, 
autocorrelation function (ACF) half-width t, variability V, emission symmetry J^jr, 
cross-correlation function time lag Ti ag , the ratio of peak energies ME p ^, peak flux on 
1024 ms scale Fi s , fluence peak energy E^, and low frequency spectral index a. 

It is worth mentioning here that, similarly to [3], we do not consider the fluence on the 
highest channel (> 300 keV) separately, although in [2] this variable alone defined a PC 
(factor). This choice is motivated by two arguments: first, because usually the fluences 
on the fourth channel are often vanishing or have great errors ("the values are noisy"); 
second, as it is noted by [3], in a sample given by long-soft GRBs only, this quantity is 
less important. 

RESULTS AND DISCUSSION 

The m number of factors, which satisfactorily reproduce the original correlation matrix, 
can be constrained ([7]) by the inequality of m < (2p + 1 — ^Sp+ l)/2 , which here 
gives m < 6.782. Since the number of factors is an integer, m = 6 is the maximum value. 

The ML method used here gives also a probability of the null hypothesis, i.e., that the 
correlation matrix of the observed variables and that reproduced by the factor solution 
are statistically identical. For 4, 5 and 6 factors, we get for the validity of the null hypo- 
thesis the probabilities of 0.000384, 0.0872, and 0.0973, respectively. These calculations 
show that 5 factors are already sufficient. The choice of 5 factors can be supported from 
the cumulative variances too. 



The first factor is defined by T, £%o, TgQ, and & , i.e., the first factor is given mainly 
by the temporal properties. The measures x and £^q are preferred over TgQ. 

The second factor is given mainly by Fi s and Hence, the second factor is related to 
the observed strength of the burst. The loadings of Ti ag and V are negligible, and hence 
there is no direct support for the luminosity estimators based on these two variables 
([11,12,10,6]). 

The first two factors are in accordance with [2] claiming that - among fluence, peak 
flux and duration - two principal components or factors exist. 

The third factor is mainly driven by E p ^. It is interesting that this peak energy 
(break-energy) in the spectra is appearing so dominantly as a significant variable in the 
third factor. It emphasizes that the spectrum itself is an important quantity (a trivially 
expectable result), and in the spectrum itself is a significant descriptor (this is not 
a triviality). The loadings of logTi ag and log are also important in the third factor. It 
means that there is some connection of X\ ag with the emitted energies of GRBs, and thus 
the luminosity indicator based on the spectral lag seems to be indirectly supported by 
the structure of third factor ([10, 13, 14]). If the Amati-relation ([1]) stands then there 
should be a linear connection between logEp^-i^^ and log Ei so . The Amati-relation 
was indeed predicted by the strong correlation between log^ and log£p k ([8]). The 
correlation between log^ and logitpk does not mean that there is a linear connection 
only between logE^ and logZspkjintrinsic- In fact, [3] arrived also to the conclusion that 

log E iso = a I log£ p k;intrinsic + b\ log ^intrinsic + ci (2) 

should hold with some suitable ai,b\,c\ constants {^intrinsic = +z)). Note that ^50 
and T strongly correlates with each other, i.e., in this equation either Xi ntr or ^so-jntrinsic 
can be used. Recently, the validity of the Amati-relation has been a matter of intensive 
discussion ([14, 9, 5, 4]). The factor loadings show that log^ is explained basically by 
the first three factors. Since Fact. 1 is mainly given by log x, Fact. 2 by logFi s and Fact. 
3 by log£pk, all this suggests that a relation of the form 

log + b 2 log x intr + c 2 log L iso + d (3) 

should exist, with some suitable a2,^2,C2,<i constants. Note that a similar relation was 
proposed also by [5]. It follows from the first three factors that the relationship of log & 
and logitpk is less important than that of the variables dominating Fact. 1 and 2, because 
log and logiipk together are mainly determined by Fact. 3, and thus for their relation 
one cannot omit the variables that are dominating Fact. 1 and Fact. 2, respectively. This 
fact disfavors a simple linear relationship only between logEp^intrinsic an ^ logiw. 

The fourth factor is dominated by V, S^g and ME V ^. However, according to [11] 
and [12], the variability should be coupled to the luminosities of GRBs, and hence to 
the fluence and peak flux. No such connection is supported by the fourth factor. Hence, 
some queries emerge here for the redshift estimations derived from the variability. 

The fifth factor is dominated by Tgo and ^o- This shows that TgQ and are not 
completely equivalent, though ^ better characterizes a burst. 



CONCLUSIONS 



The results of the paper may be summarized as follows. 

• No more than 5 factors should be introduced. This essential lowering of the signif- 
icant variables is the key result of this paper. 

• The structure of factors is similar to the PCs of [3]. The number of important 
quantities is more accurately defined here. 

• The first factor is given mainly by the temporal variables, and the quantities x and 

are preferred. 

• The second factor is related to the strength of the burst. 

• The connection of £p k in the third factor with other quantities, and the structure 
of the first three factors cast considerable doubts about the Amati-relation in its 
original form. For the luminosity indicators based on the spectral lag some support 
emerges from the third factor. 

• The variability in fourth factor does not support its connection to the intrinsic 
luminosities, and the pseudo-redshift estimations based on the variability. 

• The fifth factor shows that T 9 q and are not completely equivalent. 

Because all these conclusions are obtained from the measured data alone, all models 
of long GRBs must respect these expectations. 
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